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Abstract 

In this paper, we introduce the Gompertz power series (GPS) class of distributions 
which is obtained by compounding Gompertz and power series distributions. This distribu¬ 
tion contains several lifetime models such as Gompertz-geometric (GG), Gompertz-Poisson 
(GP), Gompertz-binomial (GB), and Gompertz-logarithmic (GL) distributions as special 
cases. Sub-models of the GPS distribution are studied in details. The hazard rate function 
of the GPS distribution can be increasing, decreasing, and bathtub-shaped. We obtain sev¬ 
eral properties of the GPS distribution such as its probability density function, and failure 
rate function. Shannon entropy, mean residual life function, quantiles and moments. The 
maximum likelihood estimation procedure via a EM-algorithm is presented, and simulation 
studies are performed for evaluation of this estimation for complete data, and the MLE of 
parameters for censored data. At the end, a real example is given. 

Keywords: EM algorithm; Gompertz distribution; Maximum likelihood estimation; Power 
series distributions. 

1 Introduction 

The exponential distribution is commonly used in many applied problems, particularly in life¬ 
time data analysis. A generalization of this distribution is the Gompertz distribution. It is 
a lifetime distribution and is often applied to describe the distribution of adult life spans by 
actuaries and demographers. In some sciences such as biology, gerontology, computer, and 
marketing science, the Gompertz distribution is considered for the analysis of survival. 

A random variable X is said to have a Gompertz distribution, denoted by X G/3,7), if 
its cumulative distribution function (cdf) is 

/3 (^'yx 1 "v 

G{x) = 1 — e 'I'i % X > 0, /3 > 0, 7 > 0, 
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( 1 . 1 ) 



and the probability density function (pdf) is 


g{x) = I3e^^e 


( 1 . 2 ) 


The Gompertz distribution is a flexible distribution that can be skewed to the right and to 
the left. The hazard rate function of Gompertz distribution is hg{x) = /3e'^^ which is a increasing 
function. The exponential distribution can be derived from the Gompertz distribution when 
7 0+. 

Also, a discrete random variable, is a member of power series distributions (truncated 
at zero) if its probability mass function is given by 


p[N = n) = -^^, n = l,2,..., 


(1.3) 


where an > 0, C{9) = ^ anO^, and 9 G (0, s) is chosen such that C{9) is finite and its 

n=l 

first, second and third derivatives are defined and shown 


’’power series distribution” is generally credited to 


Noack 


b y C'( X C"{.) and C'"(.). The term 
(119501 1. This family of distributions 


includes many of the most common distributions, including the binomial, Poisson, geometric, 
negative binomial, lo garithmic distributions. For more details of power series distributions, see 


Johnson et al. 


(120051), page 75. 


In this paper, we compound the Gompertz and power series distributions and introduce a 



The remainder of our paper is organized as follows: in Section [2l we give the density and 
failure rate functions of the GPS distribution. Some properties such as quantiles, moments, 
order statistics, Shannon entropy and mean residual life are given in Section [3l Special cases 
of GPS distribution are given in Section 01 We discuss estimation by maximum likelihood 
and provide an expression for Fisher’s information matrix in Section l5l In this Section, we 






































present the estimation based on EM-algorithm, and Section [ 6 ] contains Monte Carlo simulation 
results on the finite sample behavior of these estimators. In this Section, we also investigate 
the properties of MLE of parameters when the data are censored. An application of GPS 
distribution is given in the Section [71 


2 The Gompertz-power series model 


The GPS model is derived as follows. Let be a random variable denoting the number of 
failure causes which it is a member of power series distributions (truncated at zero). For given 
N, let Xi,X 2 , ■■■,Xn be independent identically distributed random variables from Gompertz 
distribution. If we consider = min(Xi,..., Xjv), then \ N = n has Gompertz dis¬ 
tribution with parameters nj3 and 7 . Therefore, the GPS class of distributions, denoted by 
GPS{I3,^,9), is defined by 


F{x) = 1 - 


C{e-eG{x)) 

W) 


= 1 - 


G{ee 




c{e) 


X > 0. 


( 2 . 1 ) 


The pdf of GPS{(3, 7 , 6) is given by 






( 2 . 2 ) 


Proposition 1. If G{0) = 9, then the Gompertz distribution function concludes from the GPS 
distribution function in m- Therefore, the Gompertz distribution is a special case of GPS 
distribution. 


Proposition 2. The limiting distribution of GPS{f3,'y,9) when 0 ^ O'*' is 


lim F{x) = 1 — e T 
which is a G{c/3,'y), where c = min{n G N : On > 0 }. 

Proposition 3. The limiting distribution of GPS{f3,'y,9) when 7 —>• 0'*' is 

, , G(9e-f^^) 

FO) = 1 - 

In fact, it is the cdf of the ex ponential-power series (EPS) distribution and is introduced by 
Chahkandi and Ganiali A200yi). This distribution c ontains several distri butions; geometric- 


exponential distribution lAdamidis and Louka. 


199H: 


Adamidis et al. 


200 a) . Poisson-exponential 






























distribution /iKua.\200il) . and logarithmic-exponential distribution iTahmasbi and Rezaei.\200a) . 
Therefore, the GPS distribution is a generalization of EPS distribution. Note that EPS distri¬ 
bution is a distribution family with decreasing failure rate (hazard rate). 

Proposition 4. The densities of GPS class can be expressed as infinite linear combination of 
density of order distribution, i.e. it can be written as 


f{x) = '^P{N = n) g(i){x;n), (2.3) 

n=\ 

where ( 7 (i)(x;n) is the pdf ofYf^i^ = min(yi, 12 , •••) ^); given by 
9 {i)ix-,n) = ng{x)[l - = njde'^^ 

i.e. Gompertz distribution with parameters nft and 7 . 

Proposition 5. The survival function and the hazard rate function of the GPS class of distri¬ 
butions, are given respectively by 


Six) = 


CiOe 


-£(e7.-l) 


CiO) 


hix) = 




CiOe 




(2.4) 


Proposition 6 . Por the pdf in (12.2p we have 


lim fix) = = l3EiN), lim fix) = 0 . 

a;->' 0 + G[o) x^+00 

Proposition 7. Eor the hazard rate function, hix), in (12.41) we have 

(39C'i9) 


lim hix) = lim fix) = , ..... 

a;->-0+ a;->-0+ C^(^) x^+oo 


lim hix) = + 00 . 


Consider C (0) = 0 + 0^^. Therefore, the pdf of GPS distribution is given as 

/ (x) = /3e^"e-f + 2O0i9e-^(^^^-i))(l + 9 ^^)-\ 

The plots of this density and its hazard rate function, for some parameters are given in Figure 
m For (5 = 0 . 1,7 = 3,9 = 1.0, this density is bimodal, and the values of modes are 0.1582 and 
1.1505. 


3 Statistical properties 

In this section, some properties of the GPS distribution, such as quantiles, moments, order 
statistics. Shannon entropy and mean residual life are obtained. 
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Figure 1: Plots of pdf and hazard rate functions of GPS with C {9) = 6 -\- 6“^^. 


3.1 Quantiles and Moments 


The quantile q of GPS distribution is given by 

x, = G-^ (l-Jc’-i((l-g)C'(0))), 


0 < g < 1, 


where G~^{y) = ^ log ^ log(l — and C'“^(.) is the inverse function of C{.). This result 
helps in simulating data from the GPS distribution with generating uniform distribution data. 

For checking the consistency of the simulating data set form GPS distribution, the his¬ 
togram for a generated data set with size 100 and the exact GPS density with G {9) = 9 + 0^*^, 
and parameters (5 = 0.1, 7 = 3, 6 * = 1.0, are displayed in Figure [2] (left). Also, the empirical 
distribution function and the exact distribution function are given in Figure [2] (right). 

Now, we obtain the moment generating function of the GPS distribution by its Laplace 
transform. Gonsider X ~ GPS{l3,'y,9). Then, the Laplace transform of the GPS class can be 
expressed as 

00 

L{s) = E{e-^^) = Y^P{N = n)Li(s), (3.1) 

n=l 


where 


n/3 Hi nX 
Li[s) = —e T- W ±{—), 

^ 'y 'y 


is the Laplace t ransform of G ompertz distribution with parameters nl3 and 7 , and Wf{z) = 

pCO g 7 / 

Ji (see 


Lenartl . 


2 OI 2 I ). Therefore, the moment generating function of the GPS distri¬ 


bution is 


Mx(t) = '^P(N = n)Li(-t) 

n=l 


7 


E 

n=l 


an9^ Hi n(3 


0 N 0 

>EE[Ne—W^{-^)]. (3.2) 

7 77 
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Figure 2: The histogram of a generated data set with size 100 and the exact GPS density (left) 
and the empirical distribution function and exact distribution function (right). 


We can use Mx{t) to obtain the central moment functions, = E\X^]. But from the 
direct calculation, we have 

r+cxD 


/ + CXD 

x^f{x)dx = Y,nN = n)E[Yl^^] 

n=l 


(3.3) 


where E[Y^ 
7 , given by 


is the rth moment of T(i), the Gompertz distribution with parameters n/3 and 


Lenartl (|2012li as 


r! "/3 1 nB 

E[Y[,^] = -eYwr\^), 


(3.4) 


r 7 

where = Ty^jrr /^°°(lnx)'’“^^-^(ix is the generalised integro-exponential function. See 


Lenart 


2012l b for some expressions and approximations about the expected value and variance 


of Gompertz distribution. For example, when (3 is close to 0, an approximate result for 
is 


0.57722). 

^ >y --y 'y 


(3.5) 


3.2 Order statistic 


Let Xi, X 2 , ■■■, Xn be a random sample of size n from GPS{f3,'y,9), then the pdf of the ith 
order statistic, say W:n, is given by 


fi-.nix) — 


n\ 


{i — l)!(n — i)l 


fix)[l - 




c{0) 


C{0) 


































where /(.) is the pdf given by (j2.2p . Also, the cdf of Xi-n is given by 


Pi'.n (^) — /. 


n\ 


[i — l)!(n — i) 


n-i (- 1 )^ 

irrE- 


n — ^ 
k 


k=0 


k + 1 


[ 1 - 


C{ee 




]k+i 


0(9) 


An analytical expression for rth moment of order statistics is obtained as 


mu = E 

k=n—i-\-l 


k—n+i—1 


k — 1\ f n 
k 


E 

k=n—i-\-l 


n — I 

n — i 


r* + 00 


^S(x)^dx 


[C(e)f 

3.3 Shannon entropy and mean residual life 


r+oo 

Jo ' 


x^-^[C(ee 




)fdx. (3.6) 


If A is a none- negative c o ntinu ous random variable with pdf f(x), then Shannon’s entropy of 


X is dehned by 


ShannonI ( 19481 ) as 


Hif) = E[-logfiX)] = - 


r+oo 


f(x)ln{f(x))dx, 


and this is usually referred to as the continuous entropy (or differential entropy). An explicit 
expression of Shannon entropy for GPS distribution is obtained as 


H(f) = - log(0/3) - y/xi - ^ + ^Mx(i) + logic(9)) - ExlAiN, 9)], 

7 7 


(3.7) 


where A{N,9) = Nu^ ^ \og(C'(9u))du. Also, the mean residual life function of X is given 


by 


m(t) = E[X - t\X >t] = 


j;°^{x-t)fix)dx _ Ci9)Ex[Bit,N,ld,-f)] 


Sit) 


Ci9e 




- t, 


where Bit, N, P,^) = J) °^Nj3xe^^e t 


+0O 


dx. 


4 Special cases of the GPS distributions 

In this Section, we consider four special cases of the GPS distribution. 


4.1 Gompertz - geometric distribution 


The geometric distribution (truncated at zero) is a special case of power series distributions with 
a„ = 1 and 0(9) = (0 < 9 < 1) . The pdf and hazard rate function of Gompertz-geometric 

(GG) distribution is given respectively by 


fix) 


il — 9)f3e^^e 
(l_0e-f(e--l))2 


(4.1) 
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Figure 3: Plots of density and hazard rate functions of GG for different values /3, 7 and 6*. 


and 


h{x) 




(4.2) 


Remark 4.1. When 6* = 1 — 6, from (14.11) we have 

fix) = 


e*(5e^^e~W^~^^ 




(4.3) 


Based on 


Marshall and Olkin \l997 ) f{x) in (14.3D also is density for all 9* > 0 (9 < 1). 


Note that when 7 —>• O"'', the pdf of extended exponential geometric (EEG) distribution (see 


Adamidis et al. 


200 . 


concludes from the pdf in dOI) with 9* > 0. The EEG hazard function 


is monotonically increasing for 9* > 1; decreasing for 0 < 0* < 1 and constant for 9* = 1. 


Remark 4.2. If 9* = 1, then the pdf in (14.3p becomes the pdf of Gompertz distribution. Note 
that the hazard rate function of Gompertz distribution is increasing. 


The plots of density and hazard rate function of GG distribution for different values of (3, 
7 and 9* are given in Figure [3l We can see that the hazard rate function of GG distribution is 
increasing or bathtub. 
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Figure 4: Plots of density and hazard rate functions of GP for different values /3, 7 and 9. 


4.2 Gompertz - Poisson distribution 


The Poisson distribution (truncated at zero) is a special case of power series distributions with 
dn = ^ and C{9) = — 1 {9 > 0). The pdf and hazard rate function of Gompertz-Poisson 

(GP) distribution are given respectively by 


f{x) 


e® — 1 


(4.4) 


and 


h{x) 


9l3e^^e 


1 - 




(4.5) 


The plots of density and hazard rate function of GP for different values of /3, 7 and 9 are 
given in Figure 01 We can see that the hazard rate function of GP distribution is increasing or 
bathtub. 


4.3 Gompertz - binomial distribution 


The binomial distribution (truncated at zero) is a special case of power series distributions with 


Ojr> — 


m 


and C{9) = {9 + I)™' — 1 (0 > 0), where m {n < m) is the number of replicas. The 


n 































Figure 5: Plots of density and hazard rate functions of GB for m = 5, and different values (3, 
7 and 0. 


pdf and hazard rate function of Gompertz - binomial (GB) distribution are given respectively 
by 


fix) 


( 6 » + 1 )”^ - 1 ' 


(4.6) 


and 


h{x) 


m9f3e'^^e r 




+ 1 ) 


m—1 


{9e + 1)”^ - 1 


(4.7) 


The plots of density and hazard rate function of GB for m = 5, and different values of /?, 
7 and 9 are given in Figure [5j We can see that the hazard rate function of GB distribution is 
increasing or bathtub. We can find that the GP distribution can be obtained as limiting of GB 
distribution if m9 —A > 0, when m —> oo. 


4.4 Gompertz - logarithmic distribution 

The logarithmic distribution (truncated at zero) is also a special case of power series distribu¬ 
tions with On = ^ and C{9) = — log(l — 9) (0 < 0 < 1). The pdf and hazard rate function of 
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Figure 6 : Plots of density and hazard rate functions of GL for different values /3, 7 and 9. 


Gompertz - logarithmic (GL) distribution are given respectively by 


fix) 




l(e-y--i) 


(9e — 1) log(l — 0) 


and 


(4.8) 


h{x) 




{9e — 1) log(l — 0e ^^) 


(4.9) 


The plots of density and hazard rate function of GL for different values of (3, 7 and 9 are 
given in Figure [ 6 l We can see that the hazard rate function of GL distribution is increasing or 
bathtub. 


5 Estimation and inference 

In this Section, we will derive the maximum likelihood estimators (MLE) of the unknown 
parameters 0 = (/ 3 , 7 ,0)^ of the GPiS'(/3, 7 , 0). Also, asymptotic confidence intervals of these 
parameters will be derived based on the Fisher information. At the end, we will propose an 
Expectation - Maximization (EM) algorithm for estimating the parameters. 



























5.1 MLE for parameters 

Let Xi, ...,Xn be a random sample from GPS{/3,^,9), and let x = (xi, .■.,Xn) be the observed 
values of this random sample. The log-likelihood function is given by 

n n 

In = = nlog( 6 ») + nlog{/3) nyx-t ^ log(L) -k ^ log(C"( 6 »ti)) - nlog(C'( 6 »)), 

i=l i=l 

where ti = e Therefore, the score function is given by U{&;x) = 

where 


din 

dp 


n 1 


n n ^ 

—1 ^ ^ — 1 


tj \og{ti)c''{etj) 

cpeti) 


dl 


^ = nx + Y,dr + eY, 


2 = 1 


2=1 


hc'peu) 
cpetp ’ 


din 

d9 


n ^ tiC"{9ti) nC'{9) 
C'{0U) C(^ 


(5.1) 

(5.2) 

(5.3) 


and bi = ^ = pdi and di = = 7 (“ log(ti) - /3xj). 

The MLE of 0, say 0, is obtained by solving the nonlinear system U{@;x) = 0. We 
cannot get an explicit form for this nonlinear system of equations and they can be calculated 
by using a numerical method, like the Newton method or the bisection method. 

For each element of the power series distributions (geometric, Poisson, logarithmic and 
binomial), we have the following theorems for the MLE’s; 


Theorem 5.1. Let gi{P;^,9,x) denote the function on RHS of the expression in \5.1\) . where 
7 and 9 are the true values of the parameters. Then, for a given 7 > 0, and 0 > 0, the roots of 
gi{P;'y,9,x) = 0, lies in the interval 


( n 

9C"{f>) , 1 
'cpdY + 


(-^log(pi)) ^ 

2 = 1 


Proof. See Appendix A.l. 


^(-^log(K)) , 


□ 


Theorem 5.2. Let g 2 {'y; P, 9, x) denote the function on RHS of the expression in 15. Sfl . where 
P and 9 are the true values of the parameters. Then, the equation g 2 ( 7 ; P, 9,x) = 0 has at least 
one root if 


Proof. See Appendix A. 2 . 


□ 











Theorem 5.3. Let g 3 { 6 -, f3,'y,x) denote the function on RHS of the expression in il5.3\) . where 
/3 and 7 are the true values of the parameters. 

a. The equation g 3 ( 6 *;/3, 7 , *) = 0 has at least one root if for all GG, GP and GL distributions 

n 

Eti > f. 

i=l 

b- df P, 7 , x) = where p = and p ^ (0) 1) then the equation {9; f5,'-f,x) = 0 has 

n n 

at least one root for GB distribution if ^ ti > ^ and E F ^ ■ 

i=l i=l 

Proof. See Appendix A.3. □ 

Theorem 5.4. The pdf, /(x|0), of GPS distribution satisfies on the regularity condistions, 

i.e. 


i. the support of f{x\&) does not depend on 0 , 
a. f{x\Q) is twice continuously differentiable with respect to 0 , 


Hi. the differentiation and integration are interchangeable in the sense that 


_d 

d& 


t r°° r°^ d 

-y_^/(xi©Mx=y__^— /(xie)c(x, 


92 


5090 ^ J-c 

Proof. The proof is obvious and for more details, see 


f{x\&)dx = 




l-oc d&d@^ 


f{x\&)dx. 


Casella and Bergen (120011 ) Section 10. □ 


Now, we derive asymptotic confidence intervals f or the parameters o f GPS distribution 


It is well-known that under regularity conditions (see 


Casella and Bergen. 


2001 


, Section 10), 


the asymptotic distribution of v ^(0 — 0 ) is multivariate normal with mean 0 and variance- 
covariance matrix J“^(0), where Jn(0) = lim„_j.o /n(0), and /n(®) is the 3x3 observed 
information matrix, i.e. 

In ( 0 ) = - 

whose elements are given in Appendix B. Therefore, an 100(1 — a) asymptotic confidence 
interval for each parameter, 0 ^, is given by 


Ipfi 

Ipi 

Ipe 

Ip'y 

^11 

I-yd 

Ipe 


he 


AC If — (0r — ■^a/2 Ar? 0r A ^ ^ ^ Irr) ? (^•^) 

where Iff is the (r, r) diagonal element of /“^(0) for r = 1,2,3 and .^^72 is the quantile ^ of 
the standard normal distribution. 


In some cases, a censoring time C* is assumed in collecting the lifetime data Aj, where 
Ci and Xi are independent. Suppose that the data consist of n independent observations 














Xi = min(Xj, Cj) and 5i = I{Xi < Ci) is such that hj = 1 if Xi is a time to event and <5^ = 0 if 
it is right censored for i = 1,..., n. The censored likelihood function is 

n 

Ls{&) cx (5.5) 

i=l 

where f{xi\@) and 5(xj|0) are the density function and survival function of GPS distribution. 
A similar procedure to the above can be used for constructing confidence interval for the 
parameters of the GPS model with a censoring time. 


5.2 EM-algorithm 


The EM algorit 

im is 

Demoster et ah. 

1977) 


im is a very powerful tool in handling the incomplete data problem (see 


19771 ). It is an iterative method, and there are two steps in each iteration: 


Expectation step or the E-step and the Maximization step or the M-step. The EM algorithm 
is especially useful if the complete data set is easy to analyze. In this Section, we develop an 
EM-algorithm for obtaining the MLE’s for the parameters of GPS distribution. 

We define a hypothetical complete-data distribution with a joint probability density func¬ 
tion in the form 


c{e) ’ 


where /3, 7 , 0 > 0, Xj > 0 and Zi G N. Therefore, the log-likelihood for the complete-data is 


l*{y, 0) oc nzlog{6) + nlog(/3) -|- nyx — — Zi{e^^'^ “ 1 ) “ ^log(C'( 0 )), 


^r=i 


(5.6) 


where y = (xi,Xn, zi,Zn), z = n ^ Yl x = n ^ ^ x*. On differentiation (15.61) 

i=l i=l 

with respect to parameters /3, 7 , and 0 , we obtain the components of the score function, 
TTf r^\ /dlt dl* dl*-., 

Uiv, ^) — ( g/3 , ffj J gd ), 


^ - I!: - - i) 

9/3 - /3 7^ 


_ fi ( 'yx- 'yx- 

= nx + — } - 1 )-^ ^ZjXje^ % 

^ i=i ^ i=i 


dj 


90 


nz c'{e) 


Prom a nonlinear system of equations U{y,&) = 0, we obtain the iterative procedure of 







the EM-algorithm as 


^h+i) = 


n7 


it) 


_ git+i) ^ ^ „(t) 

^ ^W(e7W^i _ 1)’ nC'(0(*+i)) ^ * 


2=1 




nx(7(*+i))2 + / 3 W ^ - 1 ) - ^(*+i)/3W ^ = 0, 

2=1 2=1 

where 0 h+i) and 7 ^+ 1 ) are found numerically. Here, for i = 1,2, ...,n, we have that 

0 (t)e-wy^^ 


4*^ = 1 + 


C'( 0 (*) 


e ^ 




In this part, we use the results of 


LouisI (|l982l l to obtain the standard errors of the es¬ 


timators from the EM-algorithm. The elements of the 3x3 observed information matrix 


Ic{@-,y) = - 


_ rW(^i 


d@ 


are given by 


dX _ n dX _ dX 


-.22 1 ^ 


ZiXie 


7*i 


9,02 /32’ 

X]X_X]X_XX_^^^ dX nz nC'je) n{C'{e)f 

’ 902 02 + C{e) (C’(0))2 ’ 

nXi _|_ tL ■y.rr'^e'y^i 

^ ^ 


9/390 909/3 9097 9790 


92/* 2/3 


5 E E + r 7 . 


972 7^ ■ 7 ^ 

Taking the conditional expectation of Ic{&',y) given x, we obtain the 3x3 matrix 


Ic{&-,x) = E{Ic{&-,y)\x) = [cij], 


where 


and 


n 


cii = ^) C 12 = C 21 = — 

^2 


1 72 1 ^ 

^ - 1 ) + -y^EiZi\xi)xie'^^% 

tXI 


ci3 — C 31 — C 23 — C 32 — 0, C 33 — ''^E{Zi\xi) + 


2=1 

nC"{9) n{C'{e)f 


2 = 1 


C22 = - 1) - ^ ^ ^E(Zi|xi)x2e 


2/3 


C(0) (C(0)) 

/? 


2 ’ 


’•W2^.7a;i 


T' i=l 


J 


2=1 


7 


2=1 


E(Zi|xi) = 1 -h 




0 e T 


^C"( 0 e" 


C'iOe 

Moving now to the computation of lm(0; x) as 




= Var[U{y,Q)\x] = [vij], 























where 


and 


^ n 1 ^ 

^ “ l)^yar(Zi|xi), Vi3 = V3i = - - - l)yar(Zilxi), 

^ 2 = 1 


2=1 


V12 = V2i = -^ - l)(e^"^ - 1 - 


7' 


2=1 


V22 = ^ - 1 - 7Xje^^*)^yar(Zj|xi), 


7 


2=1 


^ n ^ n 

V23 = V32 = - 1 - lXie^'^^)Var{Zi\xi), V 33 = ■^'^Var{Zi\xi), 

i=l i=l 


Var{Z\x) = E{Z^\x) - {E{Z\x)f 

1 °° 1 
= cw E ‘ - icwpic'f"-)+ 

= + 39.C"(9.)1 - p7^|C'(«.) + 9.C"(«.)1", 

_ §_ (a'yx _ 1 \ 

in which 0* = 0e t" ^ . Therefore, we obtain the observed information as 

7(0; x) = Xc(0; x) - Xm(0; ®). 

The standard errors of the MLE’s of the EM-algorithm are the square root of the diagonal 
elements of the /“^(0;a;). 


6 Simulation 

This section presents the results of three simulation studies. Eirst, a simulation study is per¬ 
formed for evaluation of parameter estimation based on the EM algorithm. No restriction has 
been imposed on the maximum number of iterations and convergence is assumed when the 
absolute difference between successive estimates are less that 10“^. 

Here, we consider the GG distribution and generate N = 1000 random samples with 
different set of parameters for n = 30,50,100, 200. In each random sample, the estimation of 
parameters as well as the Eisher information matrix are obtained. Then, the average value 
of estimations (AE), mean square errors (MSE), variance of estimations (VS), the average 
value of inverse of Eisher information (EE) matrices, and coverage probabilities (CP) of the 
95% conhdence interval in (|5.4I) are computed. The results are given in Table [TJ and we can 
conclude that 






Table 1; The average MLE’s, mean square errors, variance of estimations, the average value of Fisher information, and coverage probability 
based on EM estimators for GG distribution_ 



Parameter 

AE 

MSE 

VS 

EE 

CP 

n 

/? 

7 

e 

/? 

7 

e 

P 

7 

e 

p 

7 

e 

P 

7 

9 

P 

7 

9 

30 

0.5 

2.0 

0.9 

0.490 

2.760 

0.891 

0.914 

4.601 

0.466 

0.102 

1.553 

0.008 

1.748 

6.111 

0.089 

0.90 

0.94 

0.91 

50 

0.5 

2.0 

0.9 

0.458 

2.582 

0.903 

0.939 

3.538 

0.460 

0.092 

1.084 

0.005 

1.593 

4.836 

0.076 

0.92 

0.94 

0.91 

100 

0.5 

2.0 

0.9 

0.446 

2.451 

0.908 

1.004 

2.796 

0.461 

0.108 

0.666 

0.005 

0.882 

2.723 

0.041 

0.95 

0.95 

0.95 

200 

0.5 

2.0 

0.9 

0.470 

2.283 

0.904 

0.941 

2.170 

0.454 

0.112 

0.442 

0.005 

0.679 

1.716 

0.027 

0.95 

0.95 

0.96 

30 

0.5 

2.0 

0.1 

0.406 

2.711 

0.207 

1.020 

4.774 

0.994 

0.039 

0.745 

0.107 

1.774 

5.318 

5.722 

0.89 

0.96 

0.87 

50 

0.5 

2.0 

0.1 

0.427 

2.587 

0.187 

1.004 

4.149 

1.004 

0.039 

0.531 

0.102 

1.143 

3.119 

3.222 

0.90 

0.94 

0.88 

100 

0.5 

2.0 

0.1 

0.457 

2.418 

0.131 

0.951 

3.371 

1.030 

0.032 

0.311 

0.086 

1.790 

5.490 

6.589 

0.92 

0.96 

0.90 

200 

0.5 

2.0 

0.1 

0.485 

2.300 

0.192 

0.914 

2.948 

1.036 

0.027 

0.211 

0.076 

0.835 

2.103 

2.602 

0.92 

0.95 

0.92 

30 

1.0 

2.0 

0.9 

0.859 

3.441 

0.915 

0.764 

8.083 

0.401 

0.213 

3.115 

0.005 

4.490 

14.220 

0.060 

0.91 

0.93 

0.93 

50 

1.0 

2.0 

0.9 

0.911 

3.123 

0.913 

0.924 

5.636 

0.399 

0.466 

2.097 

0.006 

4.339 

9.036 

0.051 

0.91 

0.94 

0.92 

100 

1.0 

2.0 

0.9 

0.913 

2.684 

0.903 

1.223 

3.474 

0.417 

0.854 

1.385 

0.011 

3.445 

5.108 

0.042 

0.92 

0.96 

0.92 

200 

1.0 

2.0 

0.9 

1.033 

2.378 

0.893 

1.234 

2.369 

0.420 

0.964 

1.024 

0.011 

2.393 

3.437 

0.027 

0.92 

0.95 

0.93 

30 

1.0 

2.0 

0.1 

0.261 

2.972 

0.274 

0.962 

5.359 

1.006 

0.128 

0.998 

0.088 

6.823 

10.343 

6.991 

0.89 

0.93 

0.91 

50 

1.0 

2.0 

0.1 

0.214 

2.814 

0.228 

0.912 

4.528 

1.057 

0.133 

0.759 

0.089 

4.393 

5.565 

3.258 

0.89 

0.92 

0.91 

100 

1.0 

2.0 

0.1 

0.185 

2.556 

0.179 

0.824 

3.360 

1.103 

0.125 

0.462 

0.083 

2.599 

3.426 

2.167 

0.91 

0.94 

0.93 

200 

1.0 

2.0 

0.1 

0.155 

2.411 

0.117 

0.771 

2.829 

1.173 

0.107 

0.336 

0.076 

1.841 

2.406 

1.618 

0.92 

0.93 

0.93 









































Table 2; The average MLE’s, mean square errors, variance of estimations, the average value of Fisher information, and coverage probability 
based on MLE estimators for GG distribution with censored data_ 



Parameter 

AE 

MSE 

VS 

EE 

CP 

n 

/? 

7 

e 

f3 

7 

e 

/? 

7 

e 

/3 

7 

e 

/? 

7 

e 

/? 

7 

9 


0.5 

2.0 

0.9 

1.141 

2.536 

0.705 

1.613 

2.092 

0.132 

1.203 

1.807 

0.094 

16.115 

14.718 

5.692 

0.86 

0.93 

0.84 

m 

0.5 

2.0 

0.9 

0.898 

2.300 

0.778 

0.866 

1.431 

0.071 

0.709 

1.342 

0.056 

8.758 

6.669 

2.715 

0.88 

0.94 

0.87 

100 

0.5 

2.0 

0.9 

0.789 

2.062 

0.822 

0.651 

0.862 

0.040 

0.568 

0.859 

0.034 

7.289 

5.956 

2.069 

0.88 

0.95 

0.88 

200 

0.5 

2.0 

0.9 

0.670 

2.042 

0.856 

0.326 

0.633 

0.018 

0.297 

0.632 

0.016 

4.143 

5.705 

1.008 

0.91 

0.95 

0.90 


0.5 

2.0 

0.1 

0.399 

2.367 

0.232 

0.059 

0.568 

0.138 

0.049 

0.434 

0.121 

4.323 

8.031 

6.697 

0.91 

0.92 

0.90 

il 

0.5 

2.0 

0.1 

0.389 

2.343 

0.260 

0.061 

0.485 

0.154 

0.049 

0.368 

0.128 

3.002 

6.738 

4.966 

0.91 

0.92 

0.91 

100 

0.5 

2.0 

0.1 

0.386 

2.304 

0.293 

0.066 

0.422 

0.174 

0.053 

0.330 

0.137 

3.390 

4.996 

4.715 

0.89 

0.93 

0.89 

200 

0.5 

2.0 

0.1 

0.377 

2.316 

0.307 

0.063 

0.363 

0.179 

0.048 

0.263 

0.136 

1.195 

4.790 

3.744 

0.89 

0.95 

0.90 


1.0 

2.0 

0.9 

1.995 

2.849 

0.746 

4.086 

3.014 

0.091 

3.098 

2.294 

0.068 

16.988 

18.095 

3.081 

0.87 

0.90 

0.85 

il 

1.0 

2.0 

0.9 

1.667 

2.594 

0.799 

2.727 

2.201 

0.052 

2.284 

1.851 

0.042 

15.772 

17.097 

2.365 

0.86 

0.91 

0.84 

100 

1.0 

2.0 

0.9 

1.308 

2.208 

0.851 

1.428 

1.120 

0.025 

1.335 

1.078 

0.022 

13.458 

15.288 

1.799 

0.85 

0.94 

0.82 

200 

1.0 

2.0 

0.9 

1.191 

2.064 

0.871 

0.902 

0.552 

0.013 

0.866 

0.548 

0.012 

10.675 

9.313 

1.069 

0.88 

0.93 

0.87 


1.0 

2.0 

0.1 

0.616 

2.994 

0.367 

0.296 

2.448 

0.215 

0.149 

1.460 

0.144 

8.004 

9.443 

6.819 

0.84 

0.93 

0.84 

il 

1.0 

2.0 

0.1 

0.628 

2.882 

0.389 

0.308 

2.110 

0.232 

0.170 

1.333 

0.148 

6.905 

4.961 

5.584 

0.80 

0.95 

0.81 

100 

1.0 

2.0 

0.1 

0.630 

2.816 

0.407 

0.325 

1.762 

0.253 

0.188 

1.098 

0.159 

5.158 

4.707 

4.130 

0.74 

0.89 

0.75 

200 

1.0 

2.0 

0.1 

0.631 

2.723 

0.413 

0.326 

1.331 

0.260 

0.190 

0.809 

0.162 

5.089 

3.507 

3.174 

0.73 

0.91 

0.74 









































Table 3; The number of cases that the criteria value of fitted distribution is smaller than the 


criteria value of fitted GG distribution 



Parameter 

AIC 

AIGG 

BIG 

n 

/? 

7 

9 

Gompert 

z GP 

GB 

GL 

Gompert 

z GP 

GB 

GL 

Gompert: 

z GP 

GB 

GL 


0.5 

2.0 

0.9 

761 

81 

71 

457 

821 

81 

71 

457 

902 

81 

71 

457 

m 

0.5 

2.0 

0.9 

648 

95 

84 

419 

703 

95 

84 

419 

901 

95 

84 

419 

100 

0.5 

2.0 

0.9 

360 

60 

48 

375 

387 

60 

48 

375 

776 

60 

48 

375 

200 

0.5 

2.0 

0.9 

146 

39 

30 

363 

152 

39 

30 

363 

541 

39 

30 

363 


0.5 

2.0 

0.1 

945 

18 

24 

350 

959 

18 

24 

350 

978 

18 

24 

350 

il 

0.5 

2.0 

0.1 

933 

19 

41 

386 

946 

19 

41 

386 

986 

19 

41 

386 

100 

0.5 

2.0 

0.1 

917 

23 

52 

418 

924 

23 

52 

418 

990 

23 

52 

418 

200 

0.5 

2.0 

0.1 

894 

33 

73 

408 

899 

33 

73 

408 

988 

33 

73 

408 


1.0 

2.0 

0.9 

492 

123 

102 

460 

588 

123 

102 

460 

706 

123 

102 

460 

m 

1.0 

2.0 

0.9 

279 

139 

120 

397 

308 

139 

120 

397 

539 

139 

120 

397 

100 

1.0 

2.0 

0.9 

84 

112 

82 

363 

89 

112 

82 

363 

282 

112 

82 

363 

200 

1.0 

2.0 

0.9 

12 

61 

43 

328 

15 

61 

43 

328 

80 

61 

43 

328 


1.0 

2.0 

0.1 

965 

25 

28 

333 

973 

25 

28 

333 

984 

25 

28 

333 

m 

1.0 

2.0 

0.1 

954 

16 

32 

352 

965 

16 

32 

352 

997 

16 

32 

352 

100 

1.0 

2.0 

0.1 

954 

25 

64 

364 

958 

25 

64 

364 

992 

25 

64 

364 

200 

1.0 

2.0 

0.1 

921 

36 

64 

387 

927 

36 

64 

387 

994 

36 

64 

387 


Table 4; The number of cases that the criteria value of fitted distribution is smaller than the 
criteria value of fitted Gompertz distribution 



Parameter 

AIG 

AIGG 

BIG 

n 

P 

7 

9 

GG 

GP 

GB 

GL 

GG 

GP 

GB 

GL 

GG 

GP GB 

GL 


0.5 

2.0 

0.9 

54 

13 

6 

231 

34 

7 

5 

180 

16 

5 

1 

92 

il 

0.5 

2.0 

0.9 

71 

39 

21 

173 

58 

25 

14 

156 

20 

3 

1 

68 

100 

0.5 

2.0 

0.9 

59 

39 

34 

161 

53 

37 

32 

150 

12 

3 

4 

47 

200 

0.5 

2.0 

0.9 

68 

68 

69 

145 

65 

64 

62 

141 

4 

3 

1 

25 


0.5 

2.0 

0.1 

47 

8 

1 

108 

28 

3 

0 

81 

12 

0 

0 

44 

il 

0.5 

2.0 

0.1 

57 

6 

3 

122 

49 

5 

1 

no 

13 

0 

0 

42 

100 

0.5 

2.0 

0.1 

97 

13 

21 

116 

86 

10 

15 

115 

13 

0 

1 

25 

200 

0.5 

2.0 

0.1 

129 

35 

31 

95 

122 

35 

30 

95 

17 

1 

0 

20 


1.0 

2.0 

0.9 

52 

7 

3 

48 

39 

0 

0 

43 

18 

0 


29 

il 

1.0 

2.0 

0.9 

53 

20 

10 

37 

44 

10 

5 

34 

13 

0 


17 

100 

1.0 

2.0 

0.9 

74 

30 

40 

25 

68 

24 

32 

24 

10 

1 

0 

8 

200 

1.0 

2.0 

0.9 

76 

42 

55 

7 

71 

37 

52 

7 

5 

2 

1 

3 


1.0 

2.0 

0.1 

34 

0 

1 

93 

20 

0 

0 

79 

5 

0 

0 

51 

il 

1.0 

2.0 

0.1 

47 

3 

0 

88 

42 

2 

0 

73 

15 

0 

0 

26 

100 

1.0 

2.0 

0.1 

65 

0 

1 

100 

61 

0 

1 

89 

8 

0 

0 

21 

200 

1.0 

2.0 

0.1 

86 

4 

4 

91 

80 

4 

4 

86 

13 

0 

0 

8 






















































i) convergence has been achieved in all cases and this emphasizes the numerical stability of the 
EM-algorithm, ii) the differences between the average estimates and the true values are almost 
small, hi) the MSE, variance of estimations, and variance based on Fisher information matrices 
decrease when the sample size increases, iv) the coverage probabilities of the confidence intervals 
for the parameters based on asymptotic approach are satisfactory and especially are close to 
the confidence coefficient, 0.95 when the sample size large. 

In the second simulation, we consider the GG distribution and generate N = 1000 random 
samples with different set of parameters for n = 30,50,100,200 and censoring percentage 
p = 0.3. Using the censored likelihood function in (j5.5p . we obtained the MLE of parameters 
as well as the Fisher information matrix. Then, the AE, MSE, VS, EE matrices, and CP of the 
95% confidence interval are computed. The results are given in Table [2l and conclusions are 
similar to the first simulation. Only, the variances based the average value of Fisher information 
matrix are very large. 

At the end, we performed a simulation study directed to model misspecification. We 
consider the GG distribution and generate N = 1000 random samples with different set of 
parameters for n = 30, 50,100,200. In each sample, considered distributions (Gompertz, GG, 
GP, GB with m = 5, GL) were fitted. The MLE of parameters, and then AIC (Akaike 
Information Criterion), AICC (AIC with correction) and BIG (Bayesian Information Criterion) 
are calculated. Using each criteria (AIC, AICC, BIG), the preferred distribution is the one with 
the smaller value. We computed the cases that the Gompertz, GP, GB, and GL distributions 
were preferred with respect to GG distribution. The results are given in Table [3] and we can 
conclude that when the real model is GG distribution i) it is usually possible to discriminate 
between GG distribution and three subclasses of GPS (GP, GB and GL), ii) when the sample 
size is large and the parameter 9 far from away from 0, we can discriminate between GG 
distribution and Gompertz distribution. In fact, when 6 is close to 0, the GPS model becomes 
to the Gompertz distribution (See Proposition 2). 

Also, we study model misspecification using generating random sample from the Gompertz 
distribution and computed the cases that the GG, GP, GB, and GL distributions were preferred 
with respect to Gompertz distribution. The results are given in Table |4] and we can conclude 
that it is usually possible to discriminate between Gompertz distribution and the subclasses of 
GPS (GG, GP, GB and GL) when the real model is Gompertz distribution. 


7 A numerical example 


In this Section, we consider a real data set a nd fit the Gompertz, GG , GP, GB (with m = 5) 


and GL distributions. The data obtained from 


Smith and Navloii (jl987l ) represent the strengths 


of 1.5 cm glass hbres, measured at t 


also studied by 


Barreto-Souza et al 


le Na tional Physical Laboratory, England. This data is 


(j2O10|): 


0.55, 0.93, 1.25, 1.36, 1.49, 1.52, 1.58, 1.61, 1.64, 1.68, 1.73, 1.81, 2.00, 0.74, 1.04, 1.27, 

1.39, 1.49, 1.53, 1.59, 1.61, 1.66, 1.68, 1.76, 1.82, 2.01, 0.77, 1.11, 1.28, 1.42, 1.50, 1.54, 

1.60, 1.62, 1.66, 1.69, 1.76, 1.84, 2.24, 0.81, 1.13, 1.29, 1.48, 1.50, 1.55, 1.61, 1.62, 1.66, 

1.70, 1.77, 1.84, 0.84, 1.24, 1.30, 1.48, 1.51, 1.55, 1.61, 1.63, 1.67, 1.70, 1.78, 1.89. 

The MLE’s of the parameters (with standard errors) for the distributions are given in Table 
[5j Note that the MLE of 9 for GL distribution is very close to 0. Therefore, the MLE’s of the 
GL and Gompertz distributions are very close. In this table, we also consider the estimation of 
parameters for three p arameters Weibull distrib ution (TW) with the following density function 


which is considered by 


Smith and Navloj (|l987l l 


fTw{x) = A 7 (x — Oy ^ exp(—A(x — dy)^ x > 6 , A > 0, 7 > 0, 9 & R. 


We give the estimation of /? = log(A) for the TW distribution because the MLE of A is very 
close to 0 . 

To test the goodness-of-fit of the distributions, we calculated the maximized log-likelihood, 
the Kolmogorov-Smirnov (K-S) statistic with its respective p-value, the AIC, AIGG and BIG 
for the six distributions. The results show that the GG distribution yields the best fit among 
the TW, GP, GB, GL, and Gompertz distributions. Also, the GG, GP, and GB distribution 
are better than Gompertz and TW distributions. The plots of the densities (together with 
the data histogram) and cumulative distribution functions in Figure [3 confirm this conclusion. 
Also, Plots of the QQ-plot of fitted distributions are given in Figure [HI 
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Figure 7: Plots (density and distribution) of fitted Gompertz, GG, GP, GB, GL and TW 
distributions for the data set. 
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Figure 8: QQ plots of the Gompertz, GG, GP, GB, GL and TW models. 
























Table 5: Parameter estimates (with std.), K-S statistic, p-value, AIC, AICC and BIC for the 
data set. 


Dis. 

Gompertz 

GG 

CP 

GB 

GL 

TW 

$ (std.) 

0.0088 ( 0 . 001 ) 

0.8023 (0.772) 

0.0006 ( 0 . 001 ) 

0.0013 (0.001) 

0.0088 ( 0 . 011 ) 

-13.9192(-) 

7 (std.) 

3.6474 (0.069) 

1.3082 (0.586) 

4.4611 (0.566) 

4.2406 (0.404) 

3.6474 (0.593) 

11.8558 (9.795) 

9 (std.) 

— 

-58.8912 (91.83) 

5.5965 (3.224) 

1.8740 (1.268) 

0.0001 (1.310) 

-1.5934 (2.637) 

- log(L) 

14.8081 

12.2288 

12.8702 

13.0212 

14.8067 

14.2853 

K-S 

0.1268 

0.0962 

0.1207 

0.1217 

0.1267 

0.0001 

p-value 

0.2636 

0.6040 

0.3177 

0.3085 

0.2636 

0.9869 

AIC 

33.6162 

30.4576 

31.7404 

32.0424 

35.6134 

34.5712 

AICC 

33.8162 

30.8644 

32.1472 

32.4491 

36.0202 

34.9774 

BIC 

37.9025 

36.8870 

38.1698 

38.4718 

42.0428 

40.9999 


Appendix 

A. 

Here, we give the proof of Theorems 15.11 and 15.31 Consider pi = exp(—— 1)). 


A.l 


Let wi{l3;'y,e,x) = ^^=1 ^ ^ Er=i log(C"( 6 'pf)). Then, tci (/3; 7 , 6 », a;) is 

strictly increasing in /3 and 




X] (p*) ’ 


lim wi (/?; 7 , 9, x) = 0 . 
0—^00 


Therefore, 


lim gi (/3; 7 , 9, x) = 00 , lim gi (/3; 7 , 6 », a;) = V log (pi) < 0. 

/3-i-0+ /3^oo ^ 

Also, 

gi (/?; 7 , 0 ,ai) < ^ + ^log(pi), gi(^;7,0,®) > ^ + + 1 ^ ^log(pi)- 

Therefore, gi {/3-,'y,9,x) < 0 when ^ gi {/3;j,9,x) > 0 when | + 

Er=i loS (pO > 0- Hence, the proof is completed. 


A.2 


It can be easily shown that 


lim g 2 ( 7 ; /3, 6 », ai) = nx - | ^ x- ( 1 + 


7-S-0+ 


2=1 


9e-0^iC" {9e-P^^y 
C'{9e-P^i) 


lim g 2 ( 7 ;/ 3 , 6 ',x) =- 00 . 












Since the limits have different signs, the equation g 2 {'y] f3,6, x) = 0 has at least one root with 
respect to 7 for fixed values /? and 9. The proof is completed. 


A.3 

(i) For GP, it is clear that 


lim g 3 ( 6 »;/ 3 , 7 ,a;) 
e^o+ 


n 



i=l 


lim g 3 ( 6 ';/ 3 , 7 ,®) = - 00 . 

9—>-oo 


Therefore, the equation g 3 (0; /?, 7 , x) 

n 

> §• 

i=l 

(ii) For GG, it is clear that 


n 

0 has at least one root for 0 > 0 , if ^ — § > 0 or 

i=l 


lim g 3 ( 6 »;/ 3 , 7 ,a;) = - 00 , 
6^00 
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Therefore, the equation g 3 (0; (3, 7 , a;) = 0 has at least one root for 0 < 0 < 1, if —n + 2 ^ > 0 
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or E > f • 

i=\ 

(iii) For GL, it is clear that 


n 

lim g 3 {e-,l3,j,x) = lim g 3 {9-,/3,'y,x) = - 00 . 
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n 

Therefore, the equation g 3 {6;l3,'y,x) = 0 has at least one root for 0 < 0 < 1, if X] “ f > 0 

i=l 
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or E > f • 

i=l 

(iv) It is clear that 
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Then, the elements of 3 x 3 observed information matrix /n(©) are given by 
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